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A portion of the disclosure of this patent document contains command formats 
and other computer language listings, all of which are subject to copyright protection. 
The copyright owner, EMC Corporation, has no objection to the facsimile reproduction 
by anyone of the patent document or the patent disclosure, as it appears in the Patent and 
5 Trademark Office patent file or records, but otherwise reserves all copyright rights 
whatsoever. 

Field of the Invention 

The invention relates generally to managing data in a data storage environment, 
10 and more particularly to a system and method for managing replication of data distributed 
over one or more computer systems. 

Background of the Invention 

15 As is known in the art, computer systems which process and store large amounts 

of data typically include a one or more processors in communication with a shared data 
storage system in which the data is stored. The data storage system may include one or 
more storage devices, usually of a fairly robust nature and useful for storage spanning 
various temporal requirements, e.g. disk drives. The one or more processors perform 

20 their respective operations using the storage system. To minimize the chance of data 
loss, the computer systems also can include a backup storage system in communication 
with the primary processor and the data storage system. Often the connection between 
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the one or more processors and the backup storage system is through a network in which 

case the processor is sometimes referred to as a "backup client. " 

The backup storage system can include a backup storage device (such as tape 

storage or any other storage mechanism), together with a system for placing data into the 

5 storage device and recovering the data from that storage device. To perform a backup, 
the client copies data from the shared storage system across the network to the backup 
storage system. Thus, an actual data file may be communicated over the network to the 
backup storage device. 

The shared storage system corresponds to the actual physical storage. For the 

10 client to write the backup data over the network to the backup storage system, the client 
first converts the backup data into file data i.e., the client retrieves the data from the 
physical storage system level, and converts the data into application level format (e.g. a 
file) through a logical volume manager level, a file system level and the application level 
When the backup storage device receives the data file, the backup storage system can 

15 take the application level data file, and convert it to its appropriate file system level 

format for the backup storage system. The data can then be converted through the logical 
volume manager level and into physical storage. 

The EMC Data Manager (EDM) is capable of such backup and restore over a 
network, as described in numerous publications available from EMC of Hopkinton, MA, 

20 including the EDM User Guide (Network) "Basic EDM Product Manual". For 

performance improvements, a backup storage architecture in which a direct connection is 
established between the shared storage system and the backup storage system was 
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conceived. Such a system is described in U.S. Patent No. 6,047,294, assigned to assignee 
of the present invention, and entitled Logical Restore from a Physical Backup in 
Computer Storage System and herein incorporated by reference. 

Today much of the data processing and storage environment is dedicated to the 

5 needs of supporting and storing large databases, which only get larger. Although data 
storage systems, such as the EMC Symmetrix Integrated Cache Disk Array, and some of 
its supporting software such as TimeFinder have made general advancements in the data 
storage art through the advanced use of disk mirroring much of the capability of such 
technology is beyond the grasp of most entities. This is because of an ever-increasing 

10 shortage of skilled computer professionals. Typically, an entity such as a company might 
employ or contract a data storage administrator to take care of data storage needs, a 
database programmer to take of database needs and general network administrators and 
other information technology professionals to take care of general computing needs. 

If one of these skilled professionals leaves or is difficult to hire then the task of 

15 storing a database and taking care of its backup and restore needs may be neglected or 
never happen in the first place. What is needed is a computer-based tool, such as a 
system or program that could automate many of these tasks and reduce the complexity so 
that such a wide array or depth of skill sets are not needed. Further it would be an 
advantage if such a tool provided solutions for disaster recovery of data. 

20 Prior art systems have allowed for restoration of source or standard data from 

replicated copies, but there has been no straight-forward simple way to get logical 
information related to the source so that another computer could take over the role of a 
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failed computer (i.e., serve as a surrogate for the failed computer). There is a long-felt 

need for a technique to enable extraction of such logical information in a straight-forward 

non-complex and fast manner so that a surrogate computer could work with replicated 

copies in substantially the same manner as the original source computer that had operated 

5 with standard data. This would be advancement in the art with particular relevance in the 

field of disaster recovery. 

Summary of the Invention 

^ 10 The present invention is a system and method for management of data replicated 

% across one or more computer systems. 

jfi The method of this invention allows management of data that may be replicated 

D across one or more computer systems. The method includes the computer-executed steps 

1 11 of establishing one or more mirrored copies of data that are copies of one or more 

^3 15 volumes of data that are part of a first volume group on a first computer system. The 

mirrored copies of data are separated or split from the respective one or more volumes of 
data. Steps include the discovering of logical information related to the one or more 
volumes of data that are part of the volume group on the first computer system. 

A map is created from the discovered information to map logical information to 
20 physical devices on the first computer system. Then a duplicate of the one or more 

mirrored copies of data is mounted on the second computer system by using the map to 
create a second volume group that is substantially identical to the first volume group. 

5 



Patent Application 
Docket Number: EMC-00-067 
Applicant: Wright etal. 
EMC CONFIDENTIAL 

In an alternative embodiment, the invention includes a system for carrying out 

method steps. In another alterative embodiment, the invention includes a program 

product for carrying out method steps. 



5 Brief Description of the Drawings 

The above and further advantages of the present invention may be better under 

stood by referring to the following description taken into conjunction with the 

accompanying drawings in which: 
10 Fig. 1 is a block diagram of a data storage network including host computer 

systems a data storage system and a backup system and also including logic for enabling 

the method of the present invention; 

Fig.2 is an exemplary representation of a computer-readable medium encoded 

with the logic of Fig. 1 for enabling the method of the present invention; 
15 Fig. 3 is a schematic representation of the data storage network of Fig. 1 in which 

the invention may be configured to operate with standard and BC V devices for 

implementing the method of this invention; 

Fig. 4 is a representation of an embodiment of the logic of Fig. 1 and showing a 

preferred functional structure; 
20 Fig. 5 is a representation of a general overview of the method steps of this 

invention; 
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Fig. 6 is a flow logic diagram illustrating some method steps of the invention 

carried out by the logic of this invention that are generally part of the steps shown in Fig. 

5; 

Fig. 7 is another flow logic diagram illustrating some method steps of the 
5 invention carried out by the logic of this invention that are generally part of the steps 
shown in Fig. 5; 

Fig. 8 is a flow logic diagram illustrating some method steps of the invention 
carried out by the logic of this invention that are generally part of the steps shown in Fig. 
5; 

10 Fig. 9 is a flow logic diagram illustrating some method steps of the invention 

carried out by the logic of this invention that are generally part of the steps shown in Fig. 
5; 

Fig. 10 is a flow logic diagram illustrating some method steps of the invention 
carried out by the logic of this invention that are generally part of the steps shown in Fig. 
15 5; 

Fig. 1 1 is a flow logic diagram illustrating some method steps of the invention 
carried out by the logic of this invention which are generally part of the steps shown in 
Fig. 5; and 

Fig. 12 is a flow logic diagram illustrating some method steps of the invention 
20 carried out by the logic of this invention that are generally part of the steps shown in Fig. 
5; 
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Detailed Description of the Preferred Embodiment 

The methods and apparatus of the present invention are intended for use with data 

5 storage systems, such as the Symmetrix Integrated Cache Disk Array system available 
from EMC Corporation of Hopkinton, MA. Specifically, this invention is directed to 
methods and apparatus for use in systems of this type that include transferring a mirrored 
set of data from a standard device to a redundant device for use in applications such as 
backup or error recovery, but which is not limited to such applications. 

10 The methods and apparatus of this invention may take the form, at least partially, 

of program code (i.e., instructions) embodied in tangible media, such as floppy diskettes, 
CD-ROMs, hard drives, random access or read only-memory, or any other machine- 
readable storage medium. When the program code is loaded into and executed by a 
machine, such as a computer, the machine becomes an apparatus for practicing the 

15 invention. The methods and apparatus of the present invention may also be embodied in 
the form of program code that is transmitted over some transmission medium, such as 
over electrical wiring or cabling, through fiber optics, or via any other form of 
transmission. And may be implemented such that herein, when the program code is 
received and loaded into and executed by a machine, such as a computer, the machine 

20 becomes an apparatus for practicing the invention. When implemented on a general- 
purpose processor, the program code combines with the processor to provide a unique 
apparatus that operates analogously to specific logic circuits. 
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The logic for carrying out the method is embodied as part of the system described 
below beginning with reference to Figs. 1. One aspect of the invention is embodied as a 
method that is described below with detailed specificity in reference to Figs. 5-12. 

Data Storage Environment including Logic for this invention 

5 

Referring now to Fig. 1, reference is now made to a data storage network 100 in 
which the invention is particularly useful and includes a data storage systeml 19, host 
computer systems 1 13a and 1 13b, and backup system 200. 

In a preferred embodiment the data storage system is a Symmetrix Integrated 
10 Cache Disk Arrays available from EMC Corporation of Hopkinton, MA. Such a data 
storage system and its implementation is fully described in U.S. Patent 6,101,497 issued 
Aug. 8, 2000, and also in U.S. Patent 5,206,939 issued April 27, 1993, each of which is 
assigned to EMC the assignee of this invention and each of which is hereby incorporated 
by reference. Consequently, the following discussion makes only general references to 
15 the operation of such systems. 

The invention is useful in an environment wherein replicating to a local volume 
denoted as a business continuance volume (BCV) is employed (Fig 2). Such a local 
system which employs mirroring for allowing access to production volumes while 
performing backup is also described in the '497 patent incorporated herein. 
20 The data storage system 1 19 includes a system memory 1 14 and sets or pluralities 

115 and 1 16 of multiple data storage devices or data stores. The system memory 1 14 can 
comprise a buffer or cache memory; the storage devices in the pluralities 1 15 and 1 16 can 
comprise disk storage devices, optical storage devices and the like. However, in a 
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preferred embodiment the storage devices are disk storage devices. The sets 115 and 116 
represent an array of storage devices in any of a variety of known configurations. 

A host adapter (HA) 1 17 provides communications between the host system 1 13 
and the system memory 1 14; disk adapters (DA) 120 and 121 provide pathways between 

5 the system memory 1 14 and the storage device pluralities 1 15 and 1 16. A bus 122 
interconnects the system memory 1 14, the host adapters 1 17 and 118 and the disk 
adapters 120 and 121. Each system memory 1 14 and 141 is used by various elements 
within the respective systems to transfer information and interact between the respective 
host adapters and disk adapters. 

10 A backup storage system 200 is connected to the data storage system 119. The 

backup storage system is preferably an EMC Data Manager (EDM) connected to the data 
storage system as described in Symmetrix Connect User Guide, P/N 200-1 13-591, Rev. 
C, December 1997, available from EMC Corporation of Hopkinton, Mass. The direct 
connection between the shared storage system and the backup storage system may be 

15 provided as a high-speed data channel 123 such as a SCSI cable or one or more fiber- 
channel cables. In this system, a user may be permitted to backup data over the network 
or the direct connection. 

Backup system 200 includes a backup/restore server 202, Logic 206 as part of the 
server, and a tape library unit 204 that may include tape medium (not shown) and a 

20 robotic picker mechanism (also not shown) as is available on the preferred EDM system. 

Logic 206 is installed and becomes part of the EDM for carrying out the method 
of this invention and the EDM becomes at least part of a system for carrying out the 
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invention. Logic 206 is preferably embodied as software for carrying out the methods of 
this invention and is preferably included at least as part of a backup/restore server 202 in 
communication with the data storage system 1 19 through an adapter 132 (e.g., a SCSI 
adapter) along communication path 123. Substantially identical logic may also be 
5 installed as software on any host computer system such as 1 13a or 1 13b, shown as logic 
206a and 206b, respectively. In a preferred embodiment the software is Unix-based and 
daemons are launched by the software for execution where needed on the backup system, 
or host computers. The deamons on each of these computers communicate through 
sockets. 

10 The Logic of this invention, in a preferred embodiment is computer program code 

in the Perl programming language. As shown in Fig. 2, it may be carried out from a 
computer-readable medium such as CD-ROM 198 encoded with Logic 206 that acts in 
cooperation with normal computer electronic memory as is known in the art. Perl is a 
Unix-based language (see e.g. Programming Perl, 2nd Edition by Larry Wall, Randal I. 

15 Shwartz, and Tom Christiansen, published by O'Reilly and Associates). Nevertheless, 
one skilled in the computer arts will recognize that the logic, which may be implemented 
interchangeably as hardware or software may be implemented in various fashions in 
accordance with the teachings presented now. 

Generally speaking, the data storage system 1 19 operates in response to 

20 commands from one or more computer or host systems, such as the host systems 1 13a 
and 1 13b, that are each connected via a host adapter, such as host adapters 1 17 and 1 18. 
The host adapters 117 and 118 transfer commands to a command buffer that is part of 
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system memory 1 14. The command buffer stores data structures and write requests that 

the disk adapters generate. The disk adapters, such as the disk adapters 120 or 121, 

respond by effecting a corresponding operation using the information in a command 

buffer. The selected disk adapter then initiates a data operation. Reading operations 

5 transfer data from the storage devices to the system memory 1 14 through a corresponding 
disk adapter and subsequently transfer data from the system memory 1 14 to the 
corresponding host adapter, such as host adapter 1 17 when the host system 1 13a initiates 
the data writing operation. 

The computer systems 1 13a and 1 13b may be any conventional computing 

10 system, each having an operating system, such as a system available from Sun 

Microsystems, and running the Solaris operating system (a version of Unix), an HP 
system running HP-UX (a Hewlett-Packard client, running a Hewlett-Packard version of 
the Unix operating system) or an IBM system running the AIX operating system (an IBM 
version of Unix) or any other system with an associated operating system such as the 

15 WINDOWS NT operating system. 

A short description of concepts useful for understanding this invention and known 
in the art is now given. A physical disk is formatted into a "physical volume" for use by 
the management software such Logical Volume Manager (LVM) software available from 
EMC. Each physical volume is split up into discrete chunks, called physical partitions or 

20 physical extents. Physical volumes are combined into a "volume group" A volume 
group is thus a collection of disks, treated as one large storage area. A "logical volume" 
consists of some number of physical partitions/extents, allocated from a single volume 

12 
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group. A "filesy stern" is simply stated a structure or a collection of files. In Unix, 

filesystem can refer to two very distinct things: the directory tree or the arrangement of 

files on disk partitions. 

Below is a short description of other useful terminology which may be 

5 understood in more detail with reference to the incorporated '497 patent. When a 
mirror is "established" the data storage system 119 creates a mirror image (copy 
or replication) of a source or standard volume. When using the preferred 
Symmetrix such a mirror is denoted as a business continuance volume (BCV), 
also referred to in general terms as a mirrored disk, and in such a context 

10 specifically as a BCV device. If data on the standard volume changes, the same 
changes are immediately applied to the mirrored disk. When a mirror is "split" the 
preferred Symmetrix data storage system 119 isolates the mirrored version of the 
disk and no further changes are applied to the mirrored volume. After a split is 
complete, the primary disk can continue to change but the mirror maintains the 

15 point-in-time data that existed at the time of the split. 

Mirrors can be "synchronized" in either direction (i.e., from the BCV to 
the standard or visa versa). For example, changes from the standard volume that 
occurred after a split to the mirror can be applied to the BCV or mirrored disk. 
This brings the mirrored disk current with the standard. If you synchronize in the 

20 other direction you can make the primary disk match the mirror. This is often the 
final step during a restore. 
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The operation of a BCV device and its corresponding BCV volume or volumes is 
more readily understood in terms of data sets stored in logical volumes and is useful for 
understanding the present invention. As known, any given logical volume may be stored 
on a portion or all of one physical disk drive or on two or more disk drives. 

5 Referring to Fig. 3, in this particular embodiment, disk adapter 120 (Fig.l) 

controls the operations of a series of physical disks 1 15 that are shown in Fig. 3 in terms 
of logical volumes 212. The segmentation or hypering of physical disks into logical 
volumes is well known in the art. 

Similarly a disk adapter interfaces logical volumes 214 to the data storage system 

10 bus 122 (Fig. 1). Each of these volumes 214 is defined as a Business Continuation 

Volume and is designated a BCV device. The concept of BCV's are described in detail 
in the incorporated '497 patent so will be only generally descried herein. Each BCV 
device comprises a standard disk controller and related disk storage devices as shown in 
Fig. 1 especially configured to independently support applications and processes. The use 

15 of these BCV devices enables a host such as host 1 13a, described from here on as the 
"source" host computer system to utilize instantaneous copies of the data in the standard 
volumes 212. In a conventional operations there typically will be at least one BCV 
volume assigned to each host device that will operate on a data set concurrently. 
However, as will be explained below, this invention, in particular logic 206 and its 

20 counterparts 206a and 206b add additional function so that the BCV volumes established 
for use by one host may be used by another host, such as host 1 13b, described from here 
on as the "target" host computer system. 
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Although the invention has particular advantages when the target and source host 
computer system are separate distinct computers, there may also be advantages in having 
the two combined together. Thus, the target and source computer may be integrally 
combined as one computer. 

5 Referring again to Fig. 3, host 1 13a may continue online transaction processing 

(such as database transaction processing) or other processing without any impact or load 
on the volumes 212, while their respective mirror images on BCV's 214 are used to back 
up data in cooperation with backup system 200. However using the Logic of this 
invention the BCV's may be established for use on another host substantially 

10 automatically under control of a computer program, rather than requiring intervention of 
an operator all along the way. The advantages and details associated with such an 
operation are described below. 

The direction of data flow for backup is from the data storage system 119 to the 
backup system 200 as represented by arrow 211. The direction of data flow for restore is 

15 to the data storage system (opposite from arrow 211), but the BCV's may be mounted on 
another host other than the one originally established in accordance with the method of 
this invention. 

The EDM normally offers several options for controlling mirror behavior before 
and after a backup or restore, which are incorporated with this invention and are therefore 
20 discussed now at a high level. (Further detail about such known polices may be found in 
a white paper available from EMC: Robert Boudrie and David Dysert, EMC Best 
Practices: Symmetrix Connect and File Level Granularity.) 
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Pre- Backup Mirror Policy 
Bring Mirrors Down — This option expects to find the mirrors established and it 
will split the mirrors automatically before the backup. If the mirrors are down already, the 
backup will fail and report an error. The error is designed to prevent the system from 
5 backing up mirrors that are in an unexpected state. 

Verify Mirrors are Down — This option expects to find the mirrors split and 
it will leave them split and perform the backup. If the mirrors are established at the 
time of backup, the backup will fail and report an error. This error is designed to 
ensure that the backup is taken for the specific point in time that the mirrored data 
10 represents. 

Bring Mirrors Down if Needed — This option checks whether the mirrors are 
established or split and it will split the mirrors if they are established. If you select 
this option, the backup will not fail regardless of the state of the mirrors. 

Bring Mirrors Down after Establishing — This option checks the mirrors 
15 and if they are not established, the EDM first establishes the mirror to ensure that it 
is an exact copy of data on the primary volume. Then the EDM splits the mirrors to 
perform the backup. 

Post-Backup Mirror Policy 

20 During post-backup processing, mirror management can be configured to 

do any of the following: 

Bring Mirrors Up — After the restore is complete, the EDM automatically 
resynchronizes the mirror to the primary disk. 
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Leave Mirrors Down — After the restore is complete, the EDM leaves the 

mirrors split from the primary disk. 

Leave Mirrors as Found — After the restore is complete, the EDM 
resynchronizes the mirrors to the primary disk if they were established to begin 
5 with. If not, the EDM leaves the mirrors split. 

The invention includes a method for managing data that may be replicated across 
one or more computer systems. The method is carried out in the above-described 
environment by the Logic of this invention, which in a preferred embodiment is a 
program code in the Perl programming language as mentioned above. 
10 The method includes the computer-executed steps of establishing one or more 

mirrored copies of data (BCV's) that are copies of one or more volumes of data (Standard 
Volumes). The BCV's are established in a conventional manner as described in the 
incorporated '497 patent. The BCV's are separated or split from the respective one more 
volumes of data in a conventional manner and which is also described in the incorporated 
15 '497 patent. 

The Standard volumes are part of a volume group on the source computer system 
113a that has an operating system 210a (Fig.3). The operating system is preferably a 
Unix operating system, such as Solaris from Sun Microsystems of California, ADC from 
IBM of New York, or HP-UX from Hewlett Packard of California. 
20 The method further includes discovering logical information related to the 

Standard volumes that are part of the volume group on the source computer system 1 13a. 
A map of the logical information to physical devices on the source computer system is 
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created, preferably in the form of a flat file that may be converted into a tree structure for 
fast verification of the logical information. That map is used to build a substantially 
identical logical configuration on the target computer system 113b, preferably after the 
logical information has been verified by using a tree structure configuration of the logical 
5 information. 

The logical configuration is used to mount a duplicate of the BCV's on the target 
computer system (denoted as mounted target BCV's). The newly mounted target BCV's 
then become part of a second volume group on the target computer system 1 13b that has 
an operating system 210b. The operating system is preferably a Unix operating system, 
10 such as Solaris from Sun Microsystems of California, ADC from IBM of New York, or 
HP-UX from Hewlett Packard of California. 

The invention is particularly useful when data on the standard volumes and BCV's 
represents data related to an application 208a and/or application 208b, and in particular 
the invention is particularly useful if the application is a database, such as an Oracle 
15 database available from Oracle Corporation of Redwood, California. 

Referring to Fig. 4, the logic 206 includes program code that enables certain 
functions and may be thought of as code modules, although the code may or may not be 
actually structured or compartmentalized in modular form, i.e., this illustrated concept is 
more logical than physical. Accordingly, D/M module 300 serves a discovery/mapping 
20 function; E/S module 302 serves an establish/split function; B/M module 304 serves a 
build/mount function; B/R module 306 serves a backup/restore function; and D/C module 
308 serves a dismount/cleanup function. Any of the functions may be accomplished by 
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calling a procedure for running such a function as part of the data storage system and the 

backup system. 

The discovery/mapping function, discovers and maps logical to physical devices 
on the source host 1 13a, and includes such information as physical and logical volumes, 
5 volume groups, and file system information. The establish/split function establishes 
BCV's or splits such from standard volumes, depending on the pre- and post- mirror 
policies in effect on source host 113a. 

The build/mount function substantially exports the BCV's established on the 
source host 1 13a to the target host 1 13b. It creates volume group, logical volume, and 
10 file system objects on the target host computer system. 

The backup/restore function performs backup of the target host BCV data that has 
been exported or migrated from the source host. The dismount/cleanup function removes 
all volume group, logical volume, and filesystem objects from the target host. 



15 Method Steps of the Invention 

Now for a better understanding of the method steps of this invention the steps are 
described in detail with reference to Figs. 5-12. 

Fig. 5 shows an overview of the entire process. In step 400 the logic 206 maps 
20 logical to physical devices on the source host. In a step 402, the logic establishes or splits 
standard to BCV's (which may be accomplished by a call to another function on the data 
storage system) in accordance with the mirror policy in effect at the source host. Step 
404, logic builds and mounts on the target host so that the BCV's are exported or 
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migrated to the target host. Step 406 is a step for Backup and/or Restore, as described in 
more detail below. Step 408 is a cleanup step in which all volume group logical volume, 
and filesystem objects are removed from the target server. 

Fig. 6 is an overview of the steps of the mapping and discovery process. Such 
processing begins in step 500. The filesystem is discovered on the source host in step 
502. The logical volume is discovered in step 504. The volume group information is 
discovered on the source host in step 506. In step 508, the map is created preferably as a 
flat file because that is an efficient data structure for compiling and using the information. 

For mapping purposes, in general, the method uses a data storage system input 
file. Preferably the input file is a three-column file that contains a list of the standard and 
BCV device serial numbers containing the data and the data copies respectively, and the 
physical address of the BCV devices. 
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The following is an example of this file: 
Table 1 : Example of data storage system input file 



Standard (STD) Device 


BCV Dev 


7902C000 


790B4000 


7902D000 


790B5000 


7902E000 


790B6000 


7902F00 


790B7000 



5 

An example of how such a map is created in the preferred embodiment for each of 
the preferred operating systems: Solaris, AIX, and HP-UX is now shown in tables 2-4, in 
the respective order mentioned. 



10 Table 2: Mapping information for Sun Solaris: 



Mapping file (for SUN Solaris) 

The .std Mapping file is generated by a Unix-based call with the -std option flag. The .std Mapping 
file is a multi-columned file of information about the Standard devices 

15 

The columns may include: 

1 . Device Serial Number-from the Data Storage system input file 

2. Physical Address (i.e., cOdOtl) 
20 3. Volume Group 

4. Logical Volume Name 

5. File Type 

6. Mount Point 

7. Serial Number 
25 8. Device Type 
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The following is an example of a flat file using such information for a Solaris 

operating system: 



3701F000 clt0d0s2 testvgl volOl ufs/mirl 947015961. 1105.sunmir2 sliced 
5 3701F000 clt0d0s2 testvgl vol02 ufs /mir2 947015961. 11 05. sunmir2 sliced 
37020000 clt0dls2 testvgl volOl ufs /mirl 947015961.1 105. sunmir2 sliced 
37020000 clt0dls2 testvgl vol02 ufs /mir2 947015961.1 105.sunmir2 sliced 



Table 3: Mapping information for IBM AIX: 



10 Mapping file (for IBM AIX) 

The .std Mapping file is generated by a Unix-based call with the -std option flag. The .std Mapping 
file is a multi-columned file of information about the Standard devices 

The columns may include: 

15 

1 . Device Serial Number-from the Data Storage System 

2. Physical Address (i.e., hdiskl) 

3. Volume Group 

4. Logical Volume Name 

20 5. Volume Group Partition Size 

6. File Type 

7. Mount Point 

8. Logical Volume Partition size 

9. Logical Volume source journal log 

25 10. Logical Volume number of devices striped over 
1 1 . Logical Volume Stripe size 



The following is an example of a flat file using such information for an AIX 
operating system: 

30 37006000 hdiskl testvg2-2 testvg2-lv01 4 jfs /testvg2/mntptl 25 loglv02 N/A N/A 

37006000 hdiskl testvg2-2 testvg2-lv02 4 jfs /testvg2/mntptl/mntpt2 25 loglv02 N/A N/A 
37006000 hdiskl testvg2-2 testvg2-lv03 4 jfs /testvg2-3 25 loglv02 N/A N/A 
37006000 hdiskl testvg2-2 testvg2-lv04 4 jfs /testvg2-4 25 loglv02 N/A N/A 
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Table 4: Mapping information for HP-UX: 



10 



15 



Mapping file (for HP-UX) 

The .std Mapping file is generated by a Unix-based call with the -std option flag. The .std Mapping 
file is a multi-columned file of information about the Standard devices 

The columns may include: 

1 . Device Serial Number-from the data storage system input file 

2. Physical Address (i.e., cOdOtl) 

3. Volume Group 

4. Logical Volume Name 

5. Logical Volume Number 

6. File Type 

7. Mount Point 



The following is an example of a flat file using such information for an AIX 
20 operating system: 

7903A000 c3t8d2 vgedm2 Ivt8d2 1 vxfs /t8d2 
7903B000 c3t8d3 vgedm2 Ivt8d3 2 vxfs /t8d3 
7903C000 c3t8d4 vgedm2 Ivt8d4 3 vxfs /t8d4 
7903D000 c3t8d5 vgedm2 Ivt8d5 4 vxfs /t8d5 

25 Referring now to Fig. 7, step 600 uses the flat file to create a tree structure. This 

structure is preferably built by a unix function call from information in the mapping files 
described above. It may be built on both the target host computer system and the source 
host computer system. It is referred to as a tree because the Volume group information 
may be placed as the root of the tree and the branches represent the device information 

30 within the group and the logical volumes within the group. It is used in step 602 to verify 
the accuracy of the map file before the map file is sent to the target host. The tree is 
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converted to a map preferably as a flat file in step 604. This flat file map is then sent 

back to the target in step 606. 

Referring to Fig. 8, the process of establishing/splitting with a backup system is 

started in step 700. The mirror policy is checked in step 702. An inquiry is posed in step 
5 704 to determine if BCV's are established in accordance with the mirror policy. If the 

answer is no then BCV's are established in step 706. The BCV's are split from the source 

host in step 708. The BCV's are made not ready to the host in step 710. 

Referring to Fig. 9, the process of beginning to build/mount logical information so 

the BCV's can be mounted on the target is begun in step 800. The volume groups are 
10 created on the target is step 802. Logical volumes are created on the target in step 804. 

The filesystem is created on the target in step 806. The device mount may now be 

completed with this logical information related to the BCV's on the target host in step 

808. 

Referring to Fig. 10, the newly mounted target BCV's may now be backed up in 
15 step 900. The application is then shut down on the target in step 902. And following the 
backup of the target BCV's cleanup steps as described in Fig. 12 and notification take 
place in step 904. 

If the software application on the target host in the source host is a database, then 
information related to the data may also be backed up, with the effect that essentially the 
20 entire database is backed up. Important information from the database includes any 
transactional data performed by the database operations, and related control files, table 
spaces, and archives/redo logs. 
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Regarding databases, these are other terminology are now discussed. The 

terminology is described with reference to an Oracle database because that is the 

preferred embodiment but one skilled in the art will recognize that other databases may 

be used with this invention. 

5 Control files contain important information in the Oracle database, including 

information that describes the instance where the datafiles and log files reside. Datafiles 
may be files on the operating system filesystem. A related term is tablespace that is the 
lowest logical layer of the Oracle data structure. The tablespace consists of one or more 
datafiles. The tablespace is important in that it provides the finest granularity for laying 

10 out data across datafiles. 

In the database there are archive files known as redo log files or simply as the 
redo log. This is where information that will be used in a restore operation is kept. 
Without the redo log files a system failure would render the data unrecoverable. When a 
log switch occurs, the log records in the filled redo log file are copied to an archive log 

15 file if archiving is enabled. 

Referring now to Fig. 1 1, the process for restoring source standard volumes is 
shown beginning at step 1000. Step 1002, poses an inquiry to determine if the restore is 
to be from the BCV's on the target or tape. In accordance with the answer the standard 
volumes are synchronized or restored from the target mounted BCV's or tape, 

20 respectively in steps 1004 or 1006. Step 1008 begins the notification and cleanup steps 
that are generally described in Fig. 12. 
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The cleanup/dismount process begins in step in 1 100 as shown in Fig. 12. The 

BCV's are dismount from the target in step 1 102. This may be accomplished for example 

with the UNIX umount command. The objects related to volume group, logical volume, 

and filesystem or move the target in steps 1 104 and 1 106. The cleanup is completed in 

5 step 1 108. The BCV's are re-established on the source (i.e., made ready to the host) in 

step in 1108. 

A system and method has been described for managing data that may be 
replicated across one or more computer systems. Having described a preferred 
embodiment of the present invention, it may occur to skilled artisans to incorporate these 
10 concepts into other embodiments. Nevertheless, this invention should not be limited to the 
disclosed embodiment, but rather only by the spirit and scope of the following claims and 
their equivalents. 
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